Square-error Clustering Scheme and Clustering Networks P C R=1 K

نویسندگان

  • Byoung-Ho Kang
  • Jang-Hee Yoo
چکیده

The goal of cluster analysis is to separate a set of objects into constituent groups so that the members of any one group diier from one another as little as possible, according to a given criteria 6]. Pal et al. 5] proposed a generalized learning vector quantiza-tion (GLVQ) algorithm and compared it with learning vector quantization (LVQ) algorithm on clustering Anderson's IRIS data 3]. In this paper, performance of hard c-means (HCM), fuzzy c-means (FCM), LVQ and GLVQ algorithm are evaluated using \wine recognition data". 1 Square-error clustering Let < be the set of reals (feature space), < p the set of p tuples of reals and a nite set X < p , X = fx 1 ; x 2 ; ; x n g and an integer c (c partition), 2 c n. Every function u : X ! f0; 1g (hard membership) or 0; 1] (fuzzy membership) is said to assign its grade of membership to each x 2 X 4]. Let v be the c tuples, that is v i 2 < p , the cluster center or prototype of class i. Then, the objective functional J m is deened as J m (U; v) = P n k=1 P c i=1 (u ik) m (d ik) 2 where d 2 ik =k x k ? v i k 2 , m = 1 for HCM and 1 < m < 1 for FCM. The HCM and FCM algorithm via iterative optimization of J m produces a hard or fuzzy c partition of data set. 2 Clustering networks LVQ is not a clustering algorithm per se; rather it can be used to generate hard c-partitions of data sets un-labeled with the nearest prototype classiier designed with its terminal prototypes 5]. In LVQ one tries to discover cluster structure in unlabeled p-dimensional data. Let X = fx 1 ; x 2 ; ; x n g < p denote samples and use c to denote the number of nodes, clusters in X, in the competitive layer. The input layer of LVQ network is connected directly to the output layer. Each node of output layer has a prototype (or weight vector) attached to it. For 1 i c , the prototypes V = (v 1 ; v 2 ; ; v n) are a network array of unknown cluster centers v i 2 < p. In this context …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A novel local search method for microaggregation

In this paper, we propose an effective microaggregation algorithm to produce a more useful protected data for publishing. Microaggregation is mapped to a clustering problem with known minimum and maximum group size constraints. In this scheme, the goal is to cluster n records into groups of at least k and at most 2k_1 records, such that the sum of the within-group squ...

متن کامل

Optimization and design of Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and Fuzzy C-Means Clustering to predict the scour after bucket spillway

Additionally, if the materials at downstream of bucket spillway are erodible, the ogee spillway is likely to overturn by the time. Therefore, the prediction of the scour after bucket spillway is pretty important. In this study, the scour depths at downstream of bucket spillway are modeled using a new meta-heuristic model. This model is developed by combination of the Adaptive Neuro-Fuzzy Infere...

متن کامل

Prediction-Based Portfolio Optimization Model for Iran’s Oil Dependent Stocks Using Data Mining Methods

This study applied a prediction-based portfolio optimization model to explore the results of portfolio predicament in the Tehran Stock Exchange. To this aim, first, the data mining approach was used to predict the petroleum products and chemical industry using clustering stock market data. Then, some effective factors, such as crude oil price, exchange rate, global interest rate, gold price, an...

متن کامل

Combination of Transformed-means Clustering and Neural Networks for Short-Term Solar Radiation Forecasting

In order to provide an efficient conversion and utilization of solar power, solar radiation datashould be measured continuously and accurately over the long-term period. However, the measurement ofsolar radiation is not available to all countries in the world due to some technical and fiscal limitations. Hence,several studies were proposed in the literature to find mathematical and physical mod...

متن کامل

A simple D-sampling based PTAS for k-means and other Clustering problems

Given a set of points P ⊂ R, the k-means clustering problem is to find a set of k centers C = {c1, ..., ck}, ci ∈ R, such that the objective function ∑ x∈P d(x,C) , where d(x,C) denotes the distance between x and the closest center in C, is minimized. This is one of the most prominent objective functions that have been studied with respect to clustering. D-sampling [7] is a simple non-uniform s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007